Non-linear Cost-sensitive Decision Tree for Multi-classification
نویسندگان
چکیده
The motivation of this paper is based on a hypothesis that non-linear decision nodes provide a better classification performance than axis-parallel decision nodes do in many practical problems, such as image classification, and voice classification. The algorithm – MNCS_DT is introduced in this paper to create non-linear splits nodes by novel discriminant analysis in decision tree for multi-classification problem and take costsensitive problem into account when the features are selected. In experiment part, we use four UCI data sets to compare the performance of MNCS_DT and C4.5 CS by costs and error rates. The performance of MNCS_DT is better than C4.5 CS. And eight data sets from UCI are used to compare the performance of three different feature sets measured by accuracy, G-mean, and operation time. The performance of feature set consisting of features that follow multivariate normal distribution and altered information gain values higher than average one is better than two other feature sets in most data sets.
منابع مشابه
Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملمقایسه روشهای مختلف یادگیری ماشین در تشخیص پرفشاری خون در بیماران دیابتی با و بدون در نظر گرفتن هزینهها
Background and Objectives: Diabetic patients are always at risk of hypertension. In this paper, the main goal was to design a native cost sensitive model for the diagnosis of hypertension among diabetics considering the prior probabilities. Methods: In this paper, we tried to design a cost sensitive model for the diagnosis of hypertension in diabetic patients, considering the distribution of...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملInducing Cost-Sensitive Non-Linear Decision Trees
This paper presents a new decision tree learning algorithm that takes account of costs of misclassification. The algorithm is based on the hypothesis that non-linear decision nodes provide a better basis for cost-sensitive induction than axis-parallel decision nodes and utilizes discriminant analysis to construct non-linear cost-sensitive decision trees. The performance of the algorithm is eval...
متن کاملConstructing Cost Sensitive Decision Trees Based on Multi-Objective Optimization
We propose a multi-objective optimization based on the cost sensitive decision tree building method. The misclassification cost, test cost, waiting time cost and information gain rate as four optimization goals by using the method of linear weighting are adopted to transfer the multiobjective optimization problem into a single objective optimization problem, as the splitting attribute selection...
متن کامل